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(54) Tumour suppressor gene 

(57) A detailed genetic map on human chromosome 
1 1 was prepared. Then, a commonly deleted region on 
the chromosome in the tumor tissues of patients with 
multiple endocrine neoplasia type 1 was identified. Fur- 
ther; by the linkage analysis on a family line with this dis- 
ease, a gene causative of this disease was localized. A 
gene present in the region common to these observa- 
tions was cloned and the structure of this gene was 
determined. Because a protein coded by this DNA is 
homologous with those of transcriptional factors, it is 
expected that the above-mentioned gene may be a 
novel tumor suppressor gene. Further, it is also 
expected that the above-mentioned gene and a protein 
coded for thereby may be useful in preparations of a 
remedy for cancer and a diagnostic drug for cancer. 
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Description 

Background of the Invention 
s field of the Invention 



yss method wherein the above-mentioned gene is used. Thus, they are usable in the field of medicines. 



10 



Description of the Related Art 



artJliL ? ^ 3 09 bme lhat 9ene mutation in 061,8 P' 3 ^ an N>oflant role in the onset of cancer Recent 

cSf a '?r ^T 9 !™ made " P0SSb,e 10 DNAs * analyze gene ,Son 2S 

cer cells and thus contributed to the remarkable development in the field of studies on cancer 

phn^ST' 8 ^ ? enW , k:ation 01 onc °8 enes - *Wch are thought to participate in the cancerization of cells and the 
abnonnal I PnHntan of cancer cells, are now in progress and the number of the oncogenes thus clS ed soS 
amounts to several tens. On the other hand, tumor suppressor genes having a reverse function have been me foc^ 
.ntense research interest in these several years. Examples of the tumor suppressor genes^chtave Sen taSiS 
so far .nclude Rb gene of retinoblastoma [Friend. S.H.. et al., Proc. Natl. Acad. Sd SsTiT to£li£% SSfJ 
20 [Une. D.R. et a... Nature. 278. 261 (1979)] and ARC gene [Kenneth. W.K.. et ZSS^SSfL SlCcSn 
cancer and WT1 gene of Wilms' tumor [Call. KM, et a... Cel.. 60. 509 (1990)]. In me casfoflSpS JZJ I fe k^own 

fZrt^t ;2" ' ^ 1233 (1 " 0): 81X1 Srivastava ' S - e « Nature. 342, 747 (1990)]. i^veTfe 

considered that there are much more unidentified tumor suppressor genes ' ™ 18 

25 Ho*/ ei ^ rine . ne °P' aSia 1 ^ EN1 ) is ^ autosomal dominant hereditary disease characterized bv the 

development of hyperplasia or neoplasm in the endocrine organs such as accessory thyroid IsleteSi^no^ in *1 

genetic defect exists m the long arm of chromosome 1 1 (1 iq). Also there is known a region whfch is deleted Sh Wah 

Accordingly, it is now the focus of world-wide interest of physicians and researches to isolate this tumor Buwmnr 

the tumor suppressor gene in this region. «soiaie 

« m a " w r 0bi6Ct 01 the preS6nt lnvention t0 provide a novel tumor suppressor gene, a transforms transformed bv a 

for bythe ^suppressor gene, an antibody against the polypeptide and methods for stua^^ 

ing and medically treating cancer with the use of them. camming, aiagnos 



40 



Disclosure of the Invention 
Summary of the Invention 



45 



50 



55 



oJS^SS, nTT 8 iS0 ' a i ed C0Smid C ' 0neS COntainin9 3 number of RFLP ™ri<efs on chromosome 1 1 and pre- 

Cs ■ iSKSffiS r 9 deV6,0PPed RFLP m " hB * a re « ion deleled - s^ 

rnL? n ^ • 8 fe9IOn where me tumor suppressor gene existed was restricted to through 

the li nkage ana lysis. As a result, the region common to these observations was specified as 1 1qi3 3aZZs 

was screened Thus a cDNA coding for an amino acid sequence being homologous with transcriptional factors such as 
human Wiln^h^r stressor gene (mi)productan^ 

eJ^TZ^^ reSPOndS ,0 ^ eX ° 9en ° US and -^nousXiii by con^reneSe^Sa S 
MM Sor^rZ^ T* ^ end ° Crine SyStCmS - After bei "9 is transmitted via the 

tTf frl tranSrnrtt " 19 system or enters ^ nuclei and thus acts on a gene or a transcriptional fac- 

tion (^L^nTS^^^ 6 " ^ ^ Ce " S heflln t0 ^ 3 ^ for *° ^ereSSSt- 
a l" (Cance " zatlon) or deam - From the very beginning, the process of the ontogeny and moiphoqenesis of an 

L ^1 I^r^ ^ that nothmg but the coordination in gene expression deperxfing mainly ?n tmSrSS" rrSS 
a living organ.sm as it is and cancer breaks out when this coordination falls into disorder. P 

Therefore, we deemed the done thus isdated as one of tumor suppressor qenes. isolated thP rDMA fho™** i„ »«, 
' en9th rt *• — »-o». As a resuit it has beJproved CfpS^^TL^ 
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cDNA in the full length is an intranuclear transcriptional regulator having a nuclear localizing signal a proline-rich 
domain and a zinc finger motif. 

Thus, the present invention relates to: 

(1) a DNA comprising the full structure or a part of the DNA represented by SEQ ID NO:1 ; 

(2) a polypeptide comprising the full structure or a part of the polypeptide coded for by the DNA represented by 
SEQ ID NO:1 ; 

(3) a transformant transformed by a plasmid having, integrated therein, the full structure or a part of the DNA rep- 
resented by SEQ ID NO:1 which can be expressed therein; 

(4) an antibody against the above-mentioned polypeptide as an antigen; and 

(5) a gene analysis method which comprises using, as a primer, a probe or a marker, a DNA comprising a part of 
the DNA represented by SEQ ID NO:1 and hybridizing the primer, the probe or the marker with a DNA to be tested. 

In other words, the present invention relates to: 

(a) a cDNA which comprises one containing the full or a part of the cDNA of the tumor suppressor qene reore- 
sented by SEQ ID NO:1; y ^ 

(b) a polypeptide which comprises one containing the full or a part of the polypeptide coded for by the cDNA of the 
tumor suppressor gene represented by SEQ ID NO:1 ; 

(c) host cells which are obtained by integrating the full or a part of the cDNA described in SEQ ID NO'1 into a plas- 
mid which can express it and transforming thereby; 

(d) an antibody against the polypeptide described in the above item (b) as an antigen; and 

(e) a gene analysis method characterized by using a DNA containing a part of the DNA sequence described in the 
above item (a) as a primer, a probe or a marker. 

With respect to the DNAs and polypeptides, those which are substantially equivalent to the DNAs and polypeptides 
described above are also included in the scope of the present invention. The expression "DNAs and polypeptides being 
substantially equivalent- means those which have been modified via, for example, deletion, replacement addition or 
insertion of the constituting bases or constituting amino acids and derivatives thereof, which exhibit the same effects as 
Yf 8 ^ ° n9,nal ° NAS ^ P 01 ^^ 5 - However, the extent of these effects is irrelevant thereto. The term "a part 
of the DNA- means a fragment composed of at least 10 bases derived from the DNA. In order to employ as a primer 
for example, a DNA fragment having a base sequence generally consisting of 1 0 to 30 bases, preferably 1 5 to 25 bases 1 
is selected. In order to employ as a probe, a DNA fragment having a base sequence generally consisting of at least 15 
bases, preferably at least 20 bases, is selected. 

The term "a part of the polypeptide" means a peptide having a sequence composed of at least 6 amino acid resi- 
dues derived from the polypeptide. When a part of a polypeptide is to be used as an antigen for the preparation of an 
antibody or as an epitope for the detection of an antibody, it is known that a peptide having a sequence consisting of 6 
amino acid residues would bind to an antibody [see WO 8403564, published on Sep. 13. 1984 (Assignee: COMMON- 
WEALTH SERUM UBS and GEYSEN. H. M.)J. A peptide having a sequence generally consisting of at least 10 amino 
acid residues, preferably at least 20 amino acid residues, is employed therefor. Although it may be anticipated that a 
peptide having a sequence consisting of 6 amino acid residues can achieve only a poor efficiency in the production of 
an antibody, such a peptide is also usable in the form of a fused peptide. 

Furthermore, an RNA which comprises one translated from the DNA represented by SEQ ID NO:1 or a part of the 
same and RNAs which are substantially equivalent thereto are included in the scope of the present invention 

Now the present invention will be described in greater detail. 
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Detailed Description of the Invention 
(1) Isolation of cDNA 
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The target cosmid library of the human chromosome 11 can be prepared in. for example, the following manner 
From human/mouse hybrid cell line containing a single human chromosome 1 1 in a mouse genomic background a 
chromosomal DMA is extracted. Then DNA fragments of the chromosomal DNA can be integrated into a vector such as 
pWE15 by a conventional method (Maniatis, T, et al.. Molecular Cloning 2nd. ed.. Cold Spring Harbor laboratory 
Press, N.Y. (1989)J. Clones having an insert originating in the human chromosome can be screened by the colony 
hybridization with the use of a whole human DNA as probe. The thus obtained cosmid clones containing a DNA origi- 
nating in the human chromosome 11 are then subjected to the fluorescent in-srtu hybridization (FISH) method [Taka- 
hash. et al.. Am J. Hum Genet, SS. 14 - 16 (1990)]. Thus, each of the multitude of the cosmid clones can be localized 
throughout the chromosome and a physical chromosomal map can be prepared. Further. RFLP markers can be iso- 
lated on the basis of the fragment length pattern which has been prepared by cleaving human DNA with several restric- 
ts enzymes INakamura et al.. Am J. Hum. Genet. 42, 854 • 859 (1988)]. Among these clones, those located around 
the region of 1 1q13 are subjected to the FISH method and the linkage analysis to thereby give a further detailed genetic 
map. Based on this map, the DNA of a cancer tissue of a patient is examined in the loss of heterozygosity (LOH) by 
utilizing the RFLP. Thus the region where the target tumor suppressor gene is located can be further restricted to 

Rom the cosmid clones existing in the region which has been thus restricted to a DNA fragment being under 
expression can be isolated by the exon trapping method [Buckler. A. et al.. Proc. Natl. Acad. Sci. USA. 88, 4005 - 4009 
(1991)]. By using the DNA fragment thus obtained as probe, the cDNA of a gene existing in the restricted region near 
q13 of human chromosome 1 1 can be cloned. 

(2) Confirmation of the whole structure of the gene 

The base sequence of the cDNA can be determined by the Maxam-Gilbert method [Maxam, A.M. and Gilbert. W. 
Proc. Natl. Acad. Sci. USA. 74, 560(1977)] or thedideoxy technique [Messing. J.. Nucleic acid Res. 9 309 (1981)1 "■ 
nK,« be ~ nfi " med ** ex*™? 1 * 5,RAC E method, the 3'RACE method or the Northern blotting that the 
cDNA obtained by the above-mentioned method contains the full length protein translation region. 

(3) Recombinant expression vectors and transformants transformed thereby 

The tumor suppressor gene cDNA obtained by the above-mentioned method, or a fragment thereof is integrated 
into an appropriate vector and then this vector is introduced into appropriate host cells to obtain a transformant By cul- 
turing this transformant in a conventional manner, a large amount of the tumor suppressor gene product or a fragment 
thereof can be obtained from the culture. More specifically, the cDNA is linked to the downstream side of the promoter 
of a vector suitable for the expression of the cDNA by a known method with the use of restriction enzymes and DNA 
ligase. Thus a recombinant expression vector can be constructed. Examples of the vectors usable therefor include plas- 
mids PRB322 and pUC18 originating in Escherichia ggB, plasmid pUBl 10 originating in Bacillus suhtife plasmidpRB15 
ong.nating in yeast, phage vectors Agt10 and Agt11, and vector SV40 originating in animal vims, though any vector 
capable of replicating and amplifying in the host cells may be used therefor without restriction. Similarly the promoter 
and the terminator are not restricted in particular and any suitable combination may be selected therefor depending on 
toe host to be used, so long as they are adapted for the host employed in the expression of a DNA sequence coding for 
the tumor suppressor gene, or a fragment thereof. Any DNA may be used as the cDNA herein so long as it codes for 
the tumor suppressor gene product, or a fragment thereof. A chemically synthesized one may be used therefor When 
the protein to be expressed is one having a physiological activity of suppressing the proliferation of cancer cells then 
toe sequence of the cDNA is not restricted to the DNA sequence represented by the SEQ ID NO:1 but a DNA haung a 
DNA sequence which has undergone partial substitution, deletion, insertion or a combination thereof may be used 
therefor as the cDNA. 

™^ e f° mt ™ ant expression vector thus obtained is introduced into a host by. for example, the competent cell 
method [J. Mol. Biol.. 53. 154 (1970)]. the protoplast method [Proc. Natl. Acad. Sci. USA, 75, 1 929 (1978)]. thecalcium 
phosphate method [Science. 22L 551 (1983)]. the in vitro packaging method [Proc. Natl. Acad Sci USA. 72 581 
(1975)] or the virus vector method [Cell. 37. 1053 (1984)] to thereby prepare a transformant Escherichia coli Bacillus 
SffiM .yeasts and animal cells are usable as the host The transformant thus obtained is then cultured in an^appTopTT 
ate medium selected depending on the employed host The culture is usually effected at a temperature of from 20 to 
45 C wrthm a pH range of from 5 to 8 and. if necessary, under aeration and/or stirring. The tumor suppressor gene prod- 
uct or a fragment thereof may be separated and purified from the culture by appropriately combining known separa- 
tion/isolation methods Examples of these methods include salting out. solvent preaprration. dialysis, gel filtration 
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electrophoresis, ion exchange chromatography, affinity chromatography and reversed phase high performance liquid 
chromatography. 

. (4) Preparation of antibody 

By using the tumor suppressor gene product or a fragment thereof as an antigen, an antibody is prepared. A poly- 
clonal antibody is prepared in accordance with a conventional method by. for example, sufficiently immunizing an ani- 
mal such as mouse, guinea pig and rabbit with the antigen by subcutaneously, intramuscularly, intraperitoneally or 
intravenously administering it a number of times, sampling the blood from the animal and then separating the serum to 
obtain the antibody. A commercially available adjuvant is also usable therefor. 

A monoclonal antibody can be prepared by a known method. For example, spleen ceils of a mouse immunized with 
the antigen described above are fused with commercially available mouse myeloma cells to thereby give hybridomas. 
Then the target monoclonal antibody can be prepared from the culture supernatant of the hybridoma or the ascites fluid 
of a mouse inoculated with the hybridoma. 

It is not necessary that the tumor suppressor gene product to be used as the antigen has the whole amino acid 
structure but a peptide having a partial structure thereof, a modified peptide, its derivative or a fused peptide formed by 
fusing this peptide with another peptide are also usable. These substances may be prepared by any of the biological 
technique and chemical synthesis technique. 

These antibodies enable the identification and determination of the peptide of the present invention in human bio- 
logical samples and thus are applicable to. for example, diagnostic drugs for diseases to which the polypeptide is 
related. The peptide can be immunologically assayed in accordance with any of the known methods including the fluo- 
rescent antibody method, the passive agglutination method and the enzyme-labeled antibody technique. 

(5) Gene analysis of human organic tissues 

Examples of the biological sample usable in the gene analysis include normal human tissues, various types of 
human tumor tissues, human Wood, human bodily fluids and human secretions. The DNA of the employed tissue may 
be extracted and prepared by, for example, the method reported by Sato, T, et al. [Cancer Res., 5Q, 7184 (1990)]. 

From the DNA sequence provided by the present invention, a part DNA sequence at an appropriate position is 
selected and a synthetic oligonucleotide having this sequence or one complementary thereto is used as a primer, a 
probe or a marker. Thus the occurrence of a mutation of this gene in man and the morphology of the mutation can be 
analyzed. Furthermore, alterations (insertion, deletion, etc.) of this gene in a sample can also be detected by these 
analyses. 

The part DNA sequence may be selected from any part of the DNA sequence of the above-mentioned gene. It is 
needless to say that an artificially modified DNA sequence may be used therefor and thus the corresponding gene 
mutation can be detected. 

The analysis may be effected by, for example, the following method. Namely, primers of two sequences are 
selected and the partial sequence between them is amplified by the PCR method. Then the amplified DNA sequence 
is directly analyzed. Alternatively, this amplification product is integrated into a plasmid in the same manner as that of 
the above-mentioned case and host cells are transformed thereby. After culturing the transformant thus obtained, the 
DNA sequence of the clone thus obtained is analyzed. Further, the ligase chain reaction method may be applied to the 
amplification [Wu et al., Genomics. 4, 560 - 569 (1989)]. Furthermore, a specific mutation in the above-mentioned gene 
in a sample can be detected by using the allele-specific PCR [Ruano and KkJd. Nucleic Acid Research, 17, 8392 
(1989)] or the ARMS method [C.R. Newton et al., Nucleic Acid Research, 17, 2503 - 2517 (1989)]. 

Similarly, a point mutation can be detected by the SSCP method [Orita et al., Proc. Natl. Acad. Sic. USA, 3£. 2766 
- 2770 (1989); and Genomics, 5. 874 - 879 (1989)] or the RNase-protection method with the use of probes containing 
the DNA sequence thus selected or an RNA sequence originating therein. By using these probes, a mutation in the 
above-mentioned gene in a sample can be detected by the Southern hybridization method or an abnormality in the 
expression level of this gene in a sample can be examined by the Northern hybridization method. 

Escherichia cqH DH5aF/pAB1, pFL2 and pCE9 each carrying a plasmid containing the cDNA of this tumor sup- 
pressor gene were deposited with National Institute of Bioscience and Human-Technology, Agency of Industrial Sci- . 
ence and Technology, Ministry of International Trade and Industry under accession numbers FERM P-14127, 14128 
and 14129, respectively, on February 8, 1 994, and they were changed to Internationa] deposition under accession num- 
bers FERM BP-4923, 4924 and 4925, respectively, on December 9, 1994. 

The DNA of the present invention has a structure homologous with those of transcriptional factors, and originates 
in the most restricted commonly deleted region on chromosome 11 in MEN 1 -associated tumors. Therefore, it is 
expected that the DNA of the present invention may be a novel tumor suppressor gene. The DNA may be used as a tool 
in a gene therapy. Further, the fragment of the DNA may be used in the gene analysis of the DNA and in the diagnosis 
of diseases to which the DNA relates. 
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The polypeptide coded to by the DMA according to the present invention may be used as a reagent tor investiaa- 
tiore and used for preparing an antibody. The antibody may be used in the qualitative or quantitative analysis of me 
polypeptide in a biological sample. Thus, it is expected that the antibody may be useful as a novel diagnostic drug. 

Brief Description of the Drawings 

«. ^ a *^ a 9 ram showing the restriction of the region in which the MEN1 gene exists by the linkage analysis and 

tne LOH analysis. 

Fig. 2 is a diagram showing cDNA clones which overlap one another and the domain structure of ZFM1 cDNA 
1 derived therefrom. 

Rg. 3 is a diagram showing the homology of the 2FM1 protein with WT1 or EGR2. 

Rg. 4 is a diagram showing the constitution of exons of the ZFM1 gene. The exons are represented by 1 to 14 The 
domains observed in cDNA are represented by A to H. ' 

Examples 

To further illustrate the present invention in greater detail and in particular, the following Examples will be oiven 
However it is to be understood that the present invention is not restricted these Examples only. 

Example 1 1solation and Dnkage analysis of cosmid clones specific for chromosome 1 1 

At the early stage of studies, it was reported based on the linkage with a PYGM (muscle glycogen phosphorylase 
gene) marker that a gene participating in the onset of MENI existed in the long arm of chromosome 1 1 [Larrson et a! 
Nature. 332, 85 - 87 (1988)]. Subsequently, it was reported that it existed in a region of 12cM located between D1 1S149 
maker and INT2 marker of 11ql3 [Nakamura et at, Am. J. Hum. Genet. 44. 751 - 755 (1989)]. We prepared a cosmid 
library from a Chinese hamster/human hybrid ceU line containing a single human chromosome 1 1 and screened cosmid 
clones containing a part of the human chromosomal DNA with the use of a whole human DNA as probe fToWno et al 
Am. J. Hum. Genet.. 43. 258 - 268 (1991); and Tanigami et al.. Am. J. Hum. Genet. 50, 56 - 64 (1992)] Then these* 
dones were tested by hybridization with a hybrid cell fine panel containing a part of human chromosome 1 1 [Tarfgami 

tl L . ™ enet> 31 56 " 64 (1992)J and were "^P" 1 on *» chromosome through the fluorescent in situ 
hybridization (FISH) method [Hori et al.. Genomics. 13. 129 - 133 (1992)]. By effecting the linkage analysis with the use 

^e!?^* y HnP °° M be de,ected> the ,ocation of MEN1 Bene was restricted to a region of 
8cM between D11S480(cCI11-319) and D11S546 (cCI1 1-363) onq13 of chromosome 1 1 [Fujimori etal.. Am. J^um 
Genet, 50. 399 - 403 (1992)] (see Rg. 1). 

Example 2 Preparation of deletion map of chromosome 1 1 in MEN1-associated tumors 

On the other hand, investigations on the loss of heterozygosity (LOH) of the chromosome 1 1 in M EN 1 -associated 
tumors have also suggested that the tumor suppressor gene exists in the above-mentioned region [Friedman et al N 
Engl. J. Med 321 213 - 218 (1989); Thakker et al.. N. Engl. J. Med.. 321 218 - 224 (1989); and Bale et al.. Cancer 

hT*:5; ' i 1 f «P w^; * *** teen <Urther t***"* 0Ut * *• °» region on chromosome 1 1q 

mthesetumors that the MENI gene exists in the telomere side of PYGM [Bystroen et al.. Proa Nat Acad Sci USA. 
8Z. 1968 -1972 (1990)]. The results of the examination on LOH are arranged together with the results of the linkage 
analysis and it is thus considered that the MEN1 gene exists in a region of about 3cM between PYGM and D1 1S546 

(S66 1). 

Example 3 Preparation of physical map of 1 1q1 3 region 

DNaV^^T" rrUf T ^ 8 ' esl " c6on enz ^ ***** * ^e breakage point After separating 
DNA fragments by the pulse field gel electrophoresis, the Southern blotting analysis was carried out by using more than 
50 cosmd clones exeting in 1 1q13 as probes. Thus, the relationship in locations among the cosmid clones has been 
Sryj t 6P ! UP ° n the Capabi ' ity * eaCh done of being hybridized with a common genomic DNA fragment. As a 

hybrKfized wrth genomic DNA fragments relating to one another and thus they are located within a range of about 1 Mb 
mthe telomere side of PYGM [Tanigami et al.. Genomics. 13. 21 - 24 (1992)]. It has been suggested that PYGM and 
CCI1 11-4 among these cosmid dones. are markers dosest to the MEN1 gene (lod values: 5.03 and 5.13) [Fujimori et 
al. Am. J. Hum. Genet. 50, 399 - 403 (1992)]. Based on the resutts of the mapping of the breakage points with restric- 
tion enzymes ,n YAC dones 1908F2 and 199A7 isolated by using PYGM as a probe, it has been clarified that cCI11- 
367. among the 6 cosmid dones as described above, is also dose to PYGM. 
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Example 4 Isolation of exon sequence from 1 1ql3 region 

As described above, cCI11-4 and cCI1 1-367 are cosmid clones which are closest to the MEN1 gene. Thus, an 
attempt was made to isolate exons from these 2 cosmid clones by the exon trapping method [BucWer, A., et al., Proc. 
Natl. Acad. Sd. USA, 4005 • 4009 (1991)]. The cosmid DNA was cleaved with Bgill or BamHI, or both J these 
enzymes, and the fragment thus obtained was finked to the BamHI site of an exon splicing vector pSPLI . Transection 
into COS-7 cells and isolation of exon sequences by the reverse transcription PCR (RT-PCR) were effected each in 
accordance with the procedure described in the original paper. Consequently, 3 exon sequences originating in cCI1 1- 
367 were obtained and named respectively S367EI, s367E2 and S367E4. These exon sequences were respectively in 
sizes of 147 bp, 76 bp and 129 bp. 

Example 5 Isolation of full-length cDNA 

By using S367E4 (i.e., one of the exon sequences obtained in the above Example 4) as probe, a human cortical 
cDNA library was screened. Thus, a clone AB1 carrying a cDNA insert of 1 kb was obtained. With the use of this done 
AB1 as a probe, further, a cDNA clone FL2 was obtained from a human fetal liver cDNA library while cDNA clones CE5, 
CE9 and CE16 were obtained from a human cerebellar cDNA library. Then, it was confirmed that each of these clones 
could be hybridized with the original cosmid clone cCI1 1-367 and mapped on the chromosome 1 1q13 with a hybrid cell 
line panel. A sequence constructed by overlapping these cDNA clones one another at the common parts corresponded 
to ZFM1 cDNAof 3200 bp (SEQ ID NO:1). This ZFM1 cDNA contained an open reading frame (ORF) of 1869 bp which 
corresponded to a sequence of base Nos. 383 to 2251 in SEQ ID NO:1 . Based on the information as will be described 
hereinbelow, it has been proved that the sequence of SEQ ID NO: 1 and that of each clone can be regarded as being 
composed of 6 domains A (base Nos. 1 to 413 in SEQ ID NO:1), B (base Nos. 414 to 542 in SEQ ID NO:1), C (base 
Nos. 543 to 618 in SEQ ID NO:1), D (base Nos. 619 to 1964 in SEQ ID NO:1). E (base Nos. 1965 to 2218 in SEQ ID 
NO:1) and F (base Nos. 2219 to 3200 in SEQ ID NO:1) and domains G and H which are completely different therefrom. 
Namely, the exon sequences S367E2 and s367E4 obtained in the above Example 4 corresponded respectively to the 
domains C and B. The cDNA done CE5 lacked in a domain E consisting of 254 base pairs corresponding to a sequence 
of base Nos. 1965 to 2218 in SEQ ID NO:1, which may be due to an alternative splicing. The cDNA clone AB1 con- 
tained domains A and B and the different one G but not the domains C, D, E and F. The cDNA clone CE16 consisted of 
the domains D and E and the different one H (see Fig. 2). 

Example 6 Characteristics of the structure of protein coded for by the tumor suppressor gene 

A protein coded for by ZFM1 cDNA consisted of 623 amino add residues and had a nuclear localizing signal con- 
taining basic amino acids in the N-terminal side. Further, a sequence C-X2-C-X4-H-X4-C (amino add Nos. 279 - 292) 
had characteristics of a zinc finger motif existing in a DNA binding protein. 118 proline residues were contained in this 
ZFM1 protein. In particular, 69 prdine residues were contained in a region of amino acid Nos. 420 to 623 thereof. The 
sequence of this region showed high homologies with Wilms' tumor suppressor gene product (WT1) [Gessler et al., 
Nature, 342, 774 - 778 (1990)] and early growth response 2 (EGR2) protein as a transcriptional factor (27.3% and 
24.0%, respectively) (see Fig. 3). WT1 is a transcription factor having a Kruppel-like zinc finger motif [Rosenberg et al.. 
Nature, 319, 336 - 339 (1986)]. EGR2 is a human homologue of an early growth response gene Krox-20 [Chavier et a!.. 
EMBO J. Z 29 - 35 (1 988)] which is expressed at the G0-G1 junction in the cell cyde of cultured mouse cells and it is 
also a transcriptional factor [Joseph et al., Proc. Natl. Acad. Sci. USA, 85, 7164 - 7168 (1988)]. The ZFM1 protein fur- 
ther had 7 proline repetitive sequences (each consisting of at least 4 proline residues located continuously) in the C- 
terminal side. One of these repetitive sequences followed a glutamine repetitive sequence and thus formed a structure 
which was almost the same as that of the hinge domain of a mineralocorticoid receptor [Arriza et al.. Science, 22L 268 
- 275 (1987)]. Such a hinge structure is essentially required in the communication between a hormone binding domain 
and a DNA binding domain [Krust et al., EMBO J., 5, 891 - 897 (1986); and Giguere et al., Cell. 46, 645 - 652 (1 986)]. 
Further, mRNAs of a number of types originating in the ZFM1 gene were expressed in hormone-produdng organs such 
as pancreas, thyroid, adrenal gland and ovarium (see Table 1 in Example 8) 

These facts indicate that the ZFM1 protein is a tumor suppressor gene which is localized in the nuclei and exerts 
its function by binding to DNA and thus suppressing the proliferation of cells and that ZFM1 is a gene which participates 
in the onset of MEN1. 

Example 7 Structure of genomic gene 

Based on the cosmid done containing the ZFM1 gene, the genomic structure of the ZFM1 gene was determined. 
The ZFM1 gene existed over a region of about 20 kb in the genomic DNA and consisted of 14 exons (see Rg. 4). As 
Fig. 4 shows, it has been revealed that these exons (Nos. 1 to 1 4) and the domains A to H described in the above Exam- 
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5 6 6 tT| « a ^ S c <,0TOin A = ^ n 1 - ^ 8 = 2- domain C = exon 3. domain D = exon 4 

pnM ^ seq ^ e 01 SEQ 10 NO: 1 contains all of these 14 exons except the exons 2a and 3a. The seouence of m. 

On S J"? d ° ma Si ) - F ' adS h 8,6 ^Econesponolngtotheex^aTap^SSe 6 
exon 14. On fte other hand, the domain G of the cONA done AB1 consisting of the domains A-EH3 is cod«ifer bv Z 
«°n2awh,ch directly fellows the «on 2 coding for the domain B. Also, the domain 

Jj-fj dom« H-D-E is coded for by the exon 3a whi* is located «ety biVSe VSl ^TZ 
Example 8 Expression of 2FM1 gene in human tissues 

hlofH^S l ^ 1 i nSert0, |l h i CDNAC,0neFL2 35 aprobe ' mR NAs of various tissues were analyzed by the Northern 
Wottng method. As a result, the expressions of ZFM1 mRNAs of 3.3 kb and 2.7 kb were observedin 7tt 3 322T 

the smaller mRNA corresponds to one containing the domain H instead of the domains A-B-C (see Ro » To raS 

ISTE - .*' ^ ^ in 9rea,er defan ' ,he rWeree ^P 80 " PCR (m-PC^aSysKs IffSX 
22? RNAs from vanous human tissues and using primer sets (see the arrow heads in FtoVZS^Sl 
reject™ domains. As a result, the expressions of ZFM1 mRNAs of various types, which were moI»mTh» ?J£ 

A-B-C-D. A-B-G and H-D (see Fig. 2) were observed in nearly all tissues, though the exoression viek* 

s tsrr^ - " " ' mRNA ha ** me — e - ~ 
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SEQUENCE LISTING 
(2) INFORMATION FOR SEQ ID NO: 1: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3200 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
(ix) FEATURE: 

(A) FEATURE KEY: 5'UTR 

(B) LOCATION: 1. .382 

(A) FEATURE KEY: CDS 

(B) LOCATION: 38 3.. 2254 

(A) FEATURE KEY: exon 1 

(B) LOCATION: 1. .413 

(A) FEATURE KEY: exon 2 

(B) LOCATION: 414.. 542 

(A) FEATURE KEY: exon 3 

(B) LOCATION: 543. .618 

(A) FEATURE KEY: exon 4 

(B) LOCATION: 619. .771 

(A) FEATURE KEY: exon 5 

(B) LOCATION: 772.. 861 
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(A) 



FEATURE KEY: exon 6 



5 



(B) 



LOCATION: 862.. 1045 



(A) 



FEATURE KEY: exon 7 



(B) 



LOCATION: 1046. .1161 



10 



(A) FEATURE KEY: exon 8 

(B) LOCATION: 1162.. 1269 

(A) FEATURE KEY: exon 9 

(B) LOCATION: 1270.. 1450 
(A) FEATURE KEY: exon 10 



(B) LOCATION: 1451.. 1724 

(A) FEATURE KEY: exon 11 

(B) LOCATION: 1725.. 1784 

(A) FEATURE KEY: exon 12 

(B) LOCATION: 1785.. 1964 

(A) FEATURE KEY: exon 13 

(B) LOCATION: 1965.. 2137 

(A) FEATURE KEY: exon 14 

(B) LOCATION: 2138.. 3132 

(A) FEATURE KEY: 3'UTR 

(B) LOCATION: 2280.. 3200 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



CGTTGCTGTC GAAATGAAGT GCGCGCTGCG ACACCTCCCA GCCCACCGAA CTCCGCCGCC 60 

ATTTCCTCGC TTGCCTAACG GTTCGGCCAA TCCCAGCGCG CATCAATGCC GGACTGAGGC 120 

TCCGCCAATC GGAGGCCGCC GATTTCGACC CTTCGCCTCG GCCCGGCCCA ATCCATTCCC 180 

CGGCCCCGCC GCCCCCGGCC CGCCCCCGCG GTGCCCTCTC TCCTCCCTCT TTGTGCGTCT 240 



20 



45 



55 
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CGCGCCGCCG CCGCCCGCCG CGTGAGAGGA CGGGCTCCGC GCGCTCCGGC AGCGCATTCG 300 
GGTCCCCTCC CCCCGGGAGG CTTGCGAAGG AGAAGCCGCC GCAGAGGAAA AGCAGGTGCC 360 
GGTGCCTGTC CCCGGGGGCG CC ATG GCG ACC GGA GCG AAC GCC ACG CCG TTG 412 

Met Ala Thr Gly Ala Asn Ala Thr Pro Leu 
15 io 
GAC TTC CCA AGT AAG AAG CGG AAG AGG AGC CGC TGG AAC CAA GAC ACA 460 
Asp Phe Pro Ser Lys Lys Arg Lys Arg Ser Arg Trp Asn Gin Asp Thr 

15 20 25 

ATG GAA CAG CCG ACA GTG ATT CCA GGA ATG CCT ACA GTT ATT CCC CCT 508 
Met Glu Gin Pro Thr Val lie Pro Gly Met Pro Thr Val He Pro Pro 

30 35 40 

GGA CTT ACT CGA GAA CAA GAA AGA GCT TAT ATA GTG CAA CTG CAG ATA 556 
Gly Leu Thr Arg Glu Gin Glu Arg Ala Tyr lie Val Gin Leu Gin He 

45 SO 55 

GAA GAC CTG ACT CGT AAA CTG CGC ACA GGG GAC CTG GGC ATC CCC CCT 604 
Glu Asp Leu Thr Arg Lys Leu Arg Thr Gly Asp Leu Gly He Pro Pro 

60 65 70 

AAC CCT GAG GAC AGG TCC CCT TCC CCT GAG CCC ATC TAC AAT AGC GAG 652 
Asn Pro Glu Asp Arg Ser Pro Ser Pro Glu Pro He Tyr Asn Ser Glu 
75 80 85 90 

GGG AAG CGG CTT AAC ACC CGA GAG TTC CGC ACC CGC AAA AAG CTG GAA 700 
Gly Lys Arg Leu Asn Thr Arg Glu Phe Arg Thr Arg Lys Lys Leu Glu 
95 ioo 105 
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15 



20 



25 



30 



35 



40 



45 



50 



GAG GAG 
Glu Glu 

TTC AAG 
Phe Lys 

AAA GTC 
Lys Val 

140 
CTG CTC 
Leu Leu 
155 

TGC AAT 
Cys Asn 

AAG GTT 
Lys Val 

CTT CAT 
Leu His 

GTG GAA 
Val Glu 
220 



CGG CAC 
Arg: His 
110 
CCA CCT 
Pro Pro 
125 

ATG ATT 
Met lie 

ATC GGG 
He Gly 

GCC AAG 
Ala Lys 

GGG CGC 
Gly Arg 
190 
GCC CTG 
Ala Leu 
205 

CAG ATA 
Gin He 



AAC CTC ATC 
Asn Leu He 

GCA GAT TAC 
Ala Asp Tyr 

CCA CAA GAT 
Pro Gin Asp 
145 

CCC AGA GGG 
Pro Arg Gly 

160 
ATT ATG ATC 
He Met lie 
175 

AAA GAT GGC 
Lys Asp Gly 

GTT ACT GCC 
Val Thr Ala 

AGA AAC ATC 
Arg Asn He 
225 



ACA GAG 
Thr Glu 
115 
AAA CCT 
Lys Pro 
130 

GAG TAC 
Glu Tyr 

AAC ACC 
Asn Thr 

CGG GGG 
Arg Gly 

CAG ATG 
Gin Met 
195 
AAT ACA 
Asn Thr 
210 

CTG AAG 
Leu Lys 



ATG GTT 
Met Val 

CCA GCA 
Pro Ala 

CCA GAA 
Pro Glu 

CTG AAG 
Leu Lys 
165 
AAA GGG 
Lys Gly 
180 

TTG CCA 
Leu Pro 

ATG GAG 
Met Glu 

CAG GGT 
Gin Gly 



GCA CTC AAT 
Ala Leu Asn 
120 

ACA CGT GTG 
Thr Arg Val 
135 

ATC AAC TTT 
He Asn Phe 
150 

AAC ATA GAG 
Asn He Glu 

TCT GTG AAA 
Ser Val Lys 

GGA GAA GAT 
Gly Glu Asp 
200 

AAC GTC AAA 
Asn Val Lys 

215 
ATC GAG ACT 
He Glu Thr 
230 



CCG GAT 748 
Pro Asp 

AGT GAT 796 
Ser Asp 

GTG GGG 844 
Val Gly 



AAG GAG 
Lys Glu 
170 
GAA GGG 
Glu Gly 
185 

GAG CCA 
Glu Pro 



892 



940 



988 



AAG GCA 1036 
Lys Ala 

CCA GAG 1084 
Pro Glu 
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GAC CAG AAT GAT CTA CGG AAG ATG CAG CTT CGG GAG TTG GCT CGC TTA 1132 
6 Asp Gin Asn Asp Leu Arg Lys Met Gin Leu Arg Glu Leu Ala Arg Leu 

235 240 245 250 

w AAT GGG ACC CTT CGG GAA GAC GAT AAC AGG ATC TTA AGA CCC TGG CAG 1180 

Asn Gly Thr Leu Arg Glu Asp Asp Asn Arg He Leu Arg Pro Trp Gin 
255 260 265 

AGC TCA GGG ACC CGC AGC ATT ACC AAC ACC ACA GTG TGT ACC AAG TGT 1228 

Ser Ser Gly Thr Arg Ser He Thr Asn Thr Thr Val Cys Thr Lys Cys 
20 270 275 280 

GGA GGG GCT GGC CAC ATT GCT TCA GAC TGT AAA TTC CAA AGG CCT GGT 1276 

Gly Gly Ala Gly His He Ala Ser Asp Cys Lys Phe Gin Arg Pro Gly 

25 

285 290 295 

GAT CCT CAG TCA GCT CAG GAT AAA GCA CGG ATG GAT AAA GAA TAT TTG 1324 
30 Asp Pro Gin Ser Ala Gin Asp Lys Ala Arg Met Asp Lys Glu Tyr Leu 
300 305 310 

TCC CTC ATG GCT GAA CTG GGT GAA GCA CCT GTC CCA GCA TCT GTG GGC 1372 

35 

Ser Leu Met Ala Glu Leu Gly Glu Ala Pro Val Pro Ala Ser Val Gly 
315 320 325 330 

* TCC ACC TCT GGG CCT GCC ACC ACA CCC CTG GCC AGC GCA CCT CGT CCT 1420 

Ser Thr Ser Gly Pro Ala Thr Thr Pro Leu Ala Ser Ala Pro Arg Pro 

„ 335 340 345 

GCT GCT CCC GCC AAC AAC CCA CCT CCA CCG TCT CTC ATG TCT ACC ACC 1468 
Ala Ala Pro Ala Asn Asn Pro Pro Pro Pro Ser Leu Met Ser Thr Thr 
350 355 360 
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w 



15 



20 



30 



35 



40 



45 



50 



CAG AGC CGC 
Gin Ser Arg 
365 

TAC CAC GGC 
Tyr His Gly 

380 
CAC AGC TTC 
His Ser Phe 
395 

CAT CCC ATG 
His Pro Met 

CCA CCA CCA 
Pro Pro Pro 

GGC CCT CCT 
Gly Pro Pro 
445 

GGG GTC TAT 
Gly Val Tyr 

460 
ATG GGC ATG 
Met Gly Met 
475 



CCA CCC TGG 
Pro Pro Trp 

ATG CAT GGA 
Met His Gly 

CCA CAC CCA 
Pro His Pro 
400 

CAG CAC AAC 
Gin His Asn 

415 
CCA CCG ATG 
Pro Pro Met 
430 

CCA ATG GAT 
Pro Met Asp 

CGC CTG CAT 
Arg Leu His 

ATG CCG CCG 
Met Pro Pro 
480 



ATG AAT 
Met Asn 
370 
GGT GGT 
Gly Gly 
385 

TTA CCC 
Leu Pro 

CCC AAT 
Pro Asn 

AAC CAG 
Asn Gin 

CAG TAC 
Gin Tyr 
450 
CAA GGA 
Gin Gly 
465 

CCG CCG 
Pro Pro 



TCT GGT 
Ser Gly 

CCT GGT 
Pro Gly 

AGC CTG 
Ser Leu 

GGA CCC 
Gly Pro 
420 
GGC CCC 
Gly Pro 
435 

CTG GGA 
Leu Gly 

AAA GGT 
Lys Gly 

CCG CCT 
Pro Pro 



CCT TCA 
Pro Ser 

GGG CCC 
Gly Pro 
390 
ACA GGT 
Thr Gly 
405 

CCA CCC 
Pro Pro 

CAC CCT 
His Pro 

AGT ACG 
Ser Thr 

ATG ATG 
Met Met 
470 
CCC AGT 
Pro Ser 
485 



GAG AGT TGG CCC 
Glu Ser Trp Pro 
375 

GGA GGT GGC CCC 
Gly Gly Gly Pro 



1516 



GGG CAT 
Gly His 

CCT TGG 
Pro Trp 

CCT GGG 
Pro Gly 
440 
CCT GTG 
Pro Val 
455 

CCG CCA 
Pro Pro 



GGT GGA 
Gly Gly 
410 
ATG CAG 
Met Gin 
425 

CAC CAT 
His His 

GGC TCT 
Gly Ser 

CCA CCT 
Pro Pro 



1564 



1612 



1660 



1708 



1756 



1804 



GGG CAG CCC CCA 
Gly Gin Pro Pro 
490 



1852 



55 
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CCC CCT CCC TCT GGT CCT CTT CCC CCA TGG CAA CAA CAG CAG CAG CAG 1900 
Pro Pro Pro Ser Gly Pro Leu Pro Pro Trp Gin Gin Gin Gin Gin Gin 

495 500 505 

CCT CCG CCA CCC CCT CCG CCC AGC AGC AGT ATG GCT TCC AGT ACC CCC 1948 
Pro Pro Pro Pro Pro Pro Pro Ser Ser Ser Met Ala Ser Ser Thr Pro 

510 515 520 

TTG CCA TGG CAG CAA AAT ACG ACG ACT ACC ACC ACG AGC GCT GGC ACA 1996 
Leu Pro Trp Gin Gin Asn Thr Thr Thr Thr Thr Thr Ser Ala Gly Thr 

525 530 535 

GGG TCC ATC CCG CCA TGG CAA CAG CAG CAG GCG GCT GCC GCA GCT TCT 2044 
Gly Ser He Pro Pro Trp Gin Gin Gin Gin Ala Ala Ala Ala Ala Ser 

540 545 550 

CCA GGA GCC CCT CAG ATG CAA GGC AAC CCC ACT ATG GTG CCC CTG CCC 2092 
Pro Gly Ala Pro Gin Met Gin Gly Asn Pro Thr Met Val Pro Leu Pro 
555 560 565 570 

CCC GGG GTC CAG CCG CCT CTG CCG CCT GGG GCC CCT CCC CCT CCG CCC 2140 
Pro Gly Val Gin Pro Pro Leu Pro Pro Gly Ala Pro Pro Pro Pro Pro 

575 580 585 

CGT. AGC ATC GAG TGT CTT CTT TGT CTT CTT TCT CTC CTC ACC CAA CTC 2188 
Arg Ser He Glu Cys Leu Leu Cys Leu Leu Ser Leu Leu Thr Gin Leu 

590 595 600 

CCT TTG CCT CTC CCC AAA CCG GGC CGC CAG GAT CCC TCC CCG CGG CGG 2236 
Pro Leu Pro Leu Pro Lys Pro Gly Arg Gin Asp Pro Ser Pro Arg Arg 
605 610 615 
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CGA TGG CCC GAG CCA TGAGAGTGAG GACTTTCCGC GCCCATTGGT GACCCTTCCA 2291 
Arg Trp Pro Glu Pro 
620 623 

GGCAGACAGC CTCAGCAACG CCCCTGGTGG ACAGGATGGT TCGGCAAAGC AGCCTGAGTT 2351 
ATTTTTGTGG ACGGAATCGG AACACGCTGG CTCCATATCG TGAAATTTTT ATTAATTTTT 2411 
TTCTTTTTCC TTTGTTACTT CTTTATCTTT TCCTTTCTTC AGACTCCGTC CAAGGAGATG 2471 
CTCTCCCCGG TCTTCTGCTG CAATTTAGAT TCCTTTGGGT TCTCTCCAGT TCTCCTTCCC 2531 
TTACCAAGGA GAGGGGAGCA AATGGTTTTG GGCAAGGGCT TTGGCCATTC ATGTCAAGCT 2591 
GGTTGTGGGT TTTTCAAGGT GCCATAGCCA CCCCCAAATA TGTTTGTTTA AAGCGTGGGG 2651 
TTTTTTAATC TCTGCCACCC TTGTCAAGGG AGTCTTGTAA AGTTGCCGAG GGTAGGTTCA 2711 
TCTCCAGGTT TCGGGATTCC CATCCGTCCT GGCGATCCTG CCAGCAGTGG GTGGGCAGCC 2771 
TGAGCTCCCT CGGGCTCGCC TGCCAGCCTG GAGTTCTTCC TGTGCTCCTT GATCACCTGA 2831 
GCTGCCTCAG ATTCCATTTG GTCCTCTCCT TCCTGGAAGG CTTCCTTTTA TGTTTTGTTT 2891 
TAATCCCAAA TGTCTGAATG TTTTGCAGTG TGTAGGGGTT TGAGCCCCTT GTTCATTCTC 2951 
CTTCCTTTTT CCTCCCGCTT CCCTCTCCAT GAAGTGATTC TGTTGACAAT AATGTATACT 3011 
GCGCGTTCTC TTCACTGGTT , TATCTGCAGA , AATTTCTCTG GGCTTTTTTC GGTGTTAGAT 3071 
TCAACACTGC GCTAAAGCGG , GGATGTTCCA , TTGAATAAAA GAGCAGTGTG GTTTTCTGGG 3131 
AAAAAAAAAA AAAAAAAAAA , AAAAAAAAAA , AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 3191 
AAAAAAAAA 3200 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION : 

(i) APPLICANT: 

(A) NAME: Cancer Institute 

(B) STREET: 37-1, Kamiikebukuro 1-chome, 

(C) CITY: Toshima-ku, Tokyo 

(E) COUNTRY: Japan 

(F) POSTAL CODE (ZIP) : none 

(A) NAME: Eisai Co., Ltd. 

(B) STREET: 6-10, Koishikawa 4-chome, 

(C) CITY: Bunkyo-ku, Tokyo 

(E) COUNTRY: JAPAN 

(F) POSTAL CODE (ZIP) : 112 

(ii) TITLE OF INVENTION: Tumor Suppressor Gene 
(iii) NUMBER OF SEQUENCES: 2 

<iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: EP 95101980.1 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3200 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: 5'UTR 

(B) LOCATION:!.. 3 82 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 383. .2254 



40 (ix) FEATURE: 

(A) NAME/ KEY: exon 1 

(B) LOCATION:1. .413 

(ix) FEATURE: 

(A) NAME/ KEY: exon 2 
^ (B) LOCATION: 414. .542 

(ix) FEATURE: 

(A) NAME/KEY: exon 3 

(B) LOCATION: 54 3. .618 

(ix) FEATURE: 
SO (A) NAME/ KEY: exon 4 

(B) LOCATION: 619. .771 
(ix) FEATURE: 

(A) NAME/KEY: exon 5 

(B) LOCATION: 772.. 861 



55 
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(ix) FEATURE: 

(A) NAME/ KEY: exon 6 

(B) LOCATION: 862. .1045 

(ix) FEATURE: 

(A) NAME/ KEY : exon 7 

(B) LOCATION: 1046. .1161 

(ix) FEATURE: 

(A) NAME/KEY: exon 8 

(B) LOCATION: 1162. .1269 

(ix) FEATURE: 

(A) NAME/ KEY: exon 9 

(B) LOCATION: 1270.. 1450 

(ix) FEATURE: 

(A) NAME/ KEY: exon 10 

(B) LOCATION: 1451. .1724 

(ix) FEATURE: 

(A) NAME/ KEY: exon 11 

(B) LOCATION: 1725. .1784 

(ix) FEATURE: 

(A) NAME/ KEY: exon 12 

(B) LOCATION: 1785. .1964 

(ix) FEATURE: 

(A) NAME/ KEY: exon 13 

(B) LOCATION: 1965. .2137 

(ix) FEATURE: 

(A) NAME/KEY: exon 14 

(B) LOCATION:2138. .3132 

(ix) FEATURE: 

(A) NAME/ KEY: 3'UTR 

(B) LOCATION:2280. .3200 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
CGTTGCTGTC GAAATGAAGT GCGCGCTGCG ACACCTCCCA GCCCACCGAA CTCCGCCGCC 
ATTTCCTCGC TTGCCTAACG GTTCGGCCAA TCCCAGCGCG CATCAATGCC GGACTGAGGC 
TCCGCCAATC GGAGGCCGCC GATTTCGACC CTTCGCCTCG GCCCGGCCCA ATCCATTCCC 
CGGCCCCGCC GCCCCCGGCC CGCCCCCGCG GTGCCCTCTC TCCTCCCTCT TTGTGCGTCT 
CGCGCCGCCG CCGCCCGCCG CGTGAGAGGA CGGGCTCCGC GCSCTCCGGC AGCGCATTCG 
GGTCCCCTCC CCCCGGGAGG CTTGCGAAGG AGAAGCCGCC GCAGAGGAAA AGCAGGTGCC 
GGTGCCTGTC CCCGGGGGCG CC ATG GCG ACC GGA GCG AAC GCC ACG CCG TTG 



GAC TTC CCA AGT AAG AAG CGG AAG AGG AGC CGC TGG AAC CAA GAC ACA 
Asp Phe Pro Ser Lys Lys Arg Lys Arg Ser Arg Trp Asn Gin Asp Thr 



Met Ala 
1 



Thr Gly Ala Asn Ala Thr Pro Leu 
5 10 



15 



20 



25 
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10 



15 



30 



35 



50 



ATG GAA CAG CCG ACA GTG ATT CCA GGA ATG CCT ACA GTT ATT CCC CCT SOA 
Met GXu Gin Pro Thr Val lie Pro Gly Met Pro Thr Val lie Pro PrI 9 
30 35 40 

GGA CTT ACT CGA GAA CAA GAA AGA GCT TAT ATA GTG CAA CTG CAG ATA " 556 

Gly Leu Thr Arg Glu Gin Glu Arg Ala Tyr lie Val Gin Leu Gin lie 
45 50 55 

GAA GAC CTG ACT CGT AAA CTG CGC ACA GGG GAC CTG GGC ATC CCC CCT 
Glu Asp Leu Thr Arg Lys Leu Arg Thr Gly Asp Leu Gly He Pro Pro 
60 65 70 

AAC CCT GAG GAC AGG TCC CCT TCC CCT GAG CCC ATC TAC AAT AGC GAG 
Asn Pro Glu Asp Arg Ser Pro Ser Pro Glu Pro He Tyr Asn Ser Glu 
75 60 85 go 

GGG AAG CGG CTT AAC ACC CGA GAG TTC CGC ACC CGC AAA AAG CTG GAA 700 
Gly Lys Arg Leu Asn Thr Arg Glu Phe Arg Thr Arg Lys Lys Leu Glu 
95 100 105 



GAG GAG CGG CAC AAC CTC ATC ACA GAG ATG GTT GCA CTC AAT CCG GAT 
Glu Glu Arg His Asn Leu He Thr Glu Met Val Ala Leu Asn Pro Asp 
20 HO 115 120 



40 GTG GAA CAG ATA AGA AAC ATC CTG AAG CAG GGT ATC GAG ACT CCA GAG 

Val Glu Gin He Arg Asn He Leu Lys Gin Gly He Glu Thr Pro Glu 
220 225 230 



604 



652 



748 



TTC AAG CCA CCT GCA GAT TAC AAA CCT CCA GCA ACA CGT GTG AGT GAT 796 
Phe Lys Pro Pro Ala Asp Tyr Ljs Pro Pro Ala Thr Arg Val Ser Asp 
125 130 135 

AAA GTC ATG ATT CCA CAA GAT GAG TAC CCA GAA ATC AAC TTT GTG GGG 844 
Lys Val Met He Pro Gin Asp Glu Tyr Pro Glu He Asn Phe Val Glv 
140 145 150 

CTG CTC ATC GGG CCC AGA GGG AAC ACC CTG AAG AAC ATA GAG AAG GAG 892 
Leu Leu He Gly Pro Arg Gly Asn Thr Leu Lys Asn He Glu Lys Glu 
155 160 165 170 

TGC AAT GCC AAG ATT ATG ATC CGG GGG AAA GGG TCT GTG AAA GAA GGG 940 
Cys Asn Ala Lys He Met He Arg Gly Lys Gly Ser Val Lys Glu Gly 
175 180 185 

AAG GTT GGG CGC AAA GAT GGC CAG ATG TTG CCA GGA GAA GAT GAG CCA 988 
Lys Val Gly Arg Lys Asp Gly Gin Met Leu Pro Gly Glu Asp Glu Pro 
1^0 195 200 

CTT CAT GCC CTG GTT ACT GCC AAT ACA ATG GAG AAC GTC AAA AAG GCA 1036 
Leu His Ala Leu Val Thr Ala Asn Thr Met Glu Asn Val Lys Lys Ala 
205 210 215 - 



1084 



GAC CAG AAT GAT CTA CGG AAG ATG CAG CTT CGG GAG TTG GCT CGC TTA 1132 
Asp Gin Asn Asp Leu Arg Lys Met Gin Leu Arg Glu Leu Ala Arg Leu 
45 2 3 5 2 4 0 2 4 5 2 5 0 

AAT GGG ACC CTT CGG GAA GAC GAT AAC AGG ATC TTA AGA CCC TGG CAG 1180 
Asn Gly Thr Leu Arg Glu Asp Asp Asn Arg He Leu Arg Pro Trp Gin 
255 260 265 



AGC TCA GGG ACC CGC AGC ATT ACC AAC ACC ACA GTG TGT ACC AAG TGT 1228 
Ser Ser Gly Thr Arg Ser He Thr Asn Thr Thr Val Cys Thr Lys Cvs 
270 275 280 

GGA GGG GCT GGC CAC ATT GCT TCA GAC TGT AAA TTC CAA AGG CCT GGT 1276 
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Gly Gly Ala Gly His lie Ala Ser Asp Cys Lys Phe Gin Arq Pro Glvr 
285 290 295 

GAT CCT CAG TCA GCT CAG GAT AAA GCA CGG ATG GAT AAA GAA TAT TTG 
Asp Pro Gin Ser Ala Gin Asp Lys Ala Arg Met Asp Lys Glu Tyr Leu 
300 305 310 

TCC CTC ATG GCT GAA CTG GGT GAA GCA CCT GTC CCA GCA TCT GTG GGC 
Ser Leu Met Ala Glu Leu Gly Glu Ala Pro Val Pro Ala Ser Val Gly 
315 320 325 330 

TCC ACC TCT GGG CCT GCC ACC ACA CCC CTG GCC AGC GCA CCT CGT CCT 
Ser Thr Ser Gly Pro Ala Thr Thr Pro Leu Ala Ser Ala Pro Arg Pro 
335 340 345 

GCT GCT CCC GCC AAC AAC CCA CCT CCA CCG TCT CTC ATG TCT ACC ACC 
Ala Ala Pro Ala Asn Asn Pro Pro Pro Pro Ser Leu Met Ser Thr Thr 
350 355 360 

. CAG AGC CGC CCA CCC TGG ATG AAT TCT GGT CCT TCA GAG AGT TGG CCC 
Gin Ser Arg Pro Pro Trp Met Asn Ser Gly Pro Ser Glu Ser Trp Pro 
365 370 375 

TAC CAC GGC ATG CAT GGA GGT GGT CCT GGT GGG CCC GGA GGT GGC CCC 
Tyr His Gly Met His Gly Gly Gly Pro Gly Gly Pro Gly Gly Gly Pro 
380 385 * 390 

CAC AGC TTC CCA CAC CCA TTA CCC AGC CTG ACA GGT GGG CAT GGT GGA 
His Ser Phe Pro His Pro Leu Pro Ser Leu Thr Gly Gly His Gly Gly 
395 400 405 410 

CAT CCC ATG CAG CAC AAC CCC AAT GGA CCC CCA CCC CCT TGG ATG CAG 
His Pro Met Gin His Asn Pro Asn Gly Pro Pro Pro Pro Trp Met Gin 
415 420 425 

CCA CCA CCA CCA CCG ATG AAC CAG GGC CCC CAC CCT CCT GGG CAC -CAT 
Pro Pro Pro Pro Pro Met Asn Gin Gly Pro His Pro Pro Gly His His 
430 ' 435 440 

GGC CCT CCT CCA ATG GAT CAG TAC CTG GGA AGT ACG CCT GTG GGC TCT 
Gly Pro Pro Pro Met Asp Gin Tyr Leu Gly Ser Thr Pro Val Gly Ser 
445 450 455 

GGG GTC TAT CGC CTG CAT CAA GGA AAA GGT ATG ATG CCG CCA CCA CCT 
Gly Val Tyr Arg Leu His Gin Gly Lys Gly Met Met Pro Pro Pro Pro 
460 465 470 

ATG GGC ATG ATG CCG CCG CCG CCG CCG CCT CCC AGT GGG CAG CCC CCA 
Met Gly Met Met Pro Pro Pro Pro Pro Pro Pro Ser Gly Gin Pro Pro 
475 480 485 - 490 

CCC CCT CCC TCT GGT CCT CTT CCC CCA TGG CAA CAA CAG CAG CAG CAG 
Pro Pro Pro Ser Gly Pro Leu Pro Pro Trp Gin Gin Gin Gin Gin Gin 
495 500 505 

CCT CCG CCA CCC CCT CCG CCC AGC AGC AGT ATG GCT TCC AGT ACC CCC 
Pro Pro Pro Pro Pro Pro Pro Ser Ser Ser Met Ala Ser Ser Thr Pro 
510 515 520 

TTG CCA TGG CAG CAA AAT ACG ACG ACT ACC ACC ACG AGC GCT GGC ACA 
Leu Pro Trp Gin Gin Asn. Thr Thr Thr Thr Thr Thr Ser Ala Gly Thr 
525 530 535 

GGG TCC ATC CCG CCA TGG CAA CAG CAG CAG GCG GCT GCC GCA GCT TCT 
Gly Ser He Pro Pro Trp Gin Gin Gin Gin Ala Ala Ala Ala Ala Ser 
540 545 550 
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15 



20 



25 



30 



35 



40 



CCA GGA GCC CCT CAG ATG CAA GGC AAC CCC ACT AT3 GTG CCC CTG CCC 
Pro Gly Ala Pro Gin Met Gin Gly Asn Pro Thr Met Val Pro Leu Pro 
555 560 565 570 

CCC GGG GTC CAG CCG CCT CTG CCG CCT GGG GCC CCT CCC CCT CCG CCC 
Pro Gly Val Gin Pro Pro Leu Pro Pro Gly Ala Pro Pro Pro Pro Pro 
575 580 585 

CGT AGC ATC GAG TGT CTT CTT TGT CTT CTT TCT CTC CTC ACC CAA CTC 
Arg Ser lie Glu Cys Leu Leu Cys Leu Leu Ser Leu Leu Thr Gin Leu 
5 *0 595 600 

CCT TTG CCT CTC CCC AAA CCG GGC CGC CAG GAT CCC TCC CCG CGG CGG 
Pro Leu Pro Leu Pro Lys Pro Gly Arg Gin Asp Pro Ser Pro Arg Arg 
605 610 615 

CGA TGG CCC GAG CCA TGA GAGTGAGGAC TTTCCGCGCC CATTGGTGAC 
Arg Trp Pro Glu Pro 
620 

CCTTCCAGGC AGACAGCCTC AGCAACGCCC CTGGTGGACA GGATGGTTCG GCAAAGCAGC 
CTGAGTTATT TTTGTGGACG GAATCGGAAC ACGCTGGCTC CATATCGTGA AATTTTTATT 
AATTTTTTTC TTTTTCCTTT GTTACTTCTT TATCTTTTCC TTTCTTCAGA CTCCGTCCAA 
GGAGATGCTC TCCCCGGTCT TCTGCTGCAA TTTAGATTCC TTTGGGTTCT CTCCAGTTCT 
CCTTCCCTTA CCAAGGAGAG GGGAGCAAAT GGTTTTGGGC AAGGGCTTTG GCCATTCATG 
TCAAGCTGGT TGTGGGTTTT TCAAGGTGCC ATAGCCACCC CCAAATATGT . TTGTTTAAAG 
CGTGGGGTTT TTTAATCTCT GCCACCCTTG TCAAGGGAGT CTTGTAAAGT TGCCGAGGGT 
AGGTTCATCT CCAGGTTTCG GGATTCCCAT CCGTCCTGGC GATCCTGCCA GCAGTGGGTG 
GGCAGCCTGA GCTCCCTCGG GCTCGCCTGC CAGCCTGGAG TTCTTCCTGT GCTCCTTGAT 
CACCTGAGCT GCCTCAGATT CCATTTGGTC CTCTCCTTCC TGGAAGGCTT CCTTTTATGT 
TTTGTTTTAA TCCCAAATGT CTGAATGTTT TGCAGTGTGT AGGGGTTTGA GCCCCTTGTT 
CATTCTCCTT CCTTTTTCCT CCCGCTTCCC TCTCCATGAA GTGATTCTGT TGACAATAAT 
GTATACTGCG CGTTCTCTTC ACTGGTTTAT CTGCAGAAAT TTCTCTGGGC TTTTTTCGGT 
GTTAGATTCA ACACTGCGCT AAAGCGGGGA TGTTCCATTG AATAAAAGAG CAGTGTGGTT 
TTCTGGGAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 
AAAAAAAAAA AAAAAA 
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2236 
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2884 

2944 

3004 

3064 

3124 

3184 

3200 
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50 



(2) INFORMATION FOR. SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 623 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



55 



22 



EP 0 727 486 A2 



Met Ala Thr Gly Ala Asn Ala Thr Pro Leu Asp Phe Fro Sex Lys Lys 
15 10 15 

Arg Lys Arg Ser Arg Trp Asn Gin Asp Thr Met Glu Gin Pro Thr Val 
20 25 30 

He Pro Gly Met Pro Thr Val He Pro Pro Gly Leu Thr Arg Glu Gin 
35 40 45 

Glu Arg Ala Tyr He Val Gin Leu Gin He Glu Asp Leu Thr Arg Lys 
50 55 60 

Leu Arg Thr Gly Asp Leu Gly He Pro Pro Asn Pro Glu Asp Arg Ser 
65 70 75 80 

Pro Ser Pro Glu Pro He Tyr Asn Ser Glu Gly Lys Arg Leu Asn Thr 
85 90 95 

Arg Glu Phe Arg Thr Arg Lys Lys Leu Glu Glu Glu Arg His Asn Leu 
100 105 110 

He Thr Glu Met Val Ala Leu Asn Pro Asp Phe Lys Pro Pro Ala Asp 
115 120 125 

Tyr Lys Pro Pro Ala Thr Arg Val Ser Asp Lys Val Met He Pro Gin 
130 135 * 140 

Asp Glu Tyr Pro Glu He Asn Phe Val Gly Leu Leu He Gly Pro Arg 
145 150 155 160 

Gly Asn Thr Leu Lys Asn He Glu Lys Glu Cys Asn Ala Lys He Met 
165 170 * ~ 175 

He Arg Gly Lys Gly Ser Val Lys Glu Gly Lys Val Gly Arg Lys Asp 
180 185 190 

Gly Gin Met Leu Pro Gly Glu Asp Glu Pro Leu His Ala Leu Val Thr 
195 200 205 

Aia Asn Thr Met Glu Asn Val Lys Lys Ala Val Glu Gin He Arg Asn 
210 215 220 

He Leu Lys Gin Gly He Glii Thr Pro Glu Asp Gin Asn Asp Leu Arg 
225 230 235 240 

Lys Met Gin Leu Arg Glu Leu Ala Arg Leu Asn Gly Thr Leu Arg Glu 
245 250 255 

Asp Asp Asn Arg He Leu Arg Pro Trp Gin Ser Ser Gly Thr Arg Ser 
260 265 270 

He Thr Asn Thr Thr Val Cys Thr Lys Cys Gly Gly Ala Gly His He 
275 280 285 

Ala Ser Asp Cys Lys Phe Gin Arg Pro Gly Asp Pro Gin Ser Ala Gin 
290 295 300 

Asp Lys Ala Arg Met Asp Lys Glu Tyr Leu Ser Leu Met Ala Glu Leu 
305 310 315 320 

Gly Glu Ala Pro Val Pro Ala Ser Val Gly Ser Thr Ser Gly Pro Ala 
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Thr Pro Leu Ala 
340 



Ser Ala Pro 



Arg 
345 



Pro Ala 



Ala 
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350 
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Pro Pro Pro Pro Ser Leu Met Ser Thr Thr Gin Ser Arg Pro Pro Tip 
355 360 365 

Met Asn Ser Gly Pro Ser Glu Ser Trp Pro Tyr His Gly Met His Gly 
370 375 380 

Gly Gly Pro Gly Gly Pro Gly Gly Gly Pro His Ser Phe Pro His Pro 
385 390 395 400 

Leu Pro Ser Leu Thr Gly Gly His Gly Gly His Pro Met Gin His Asn 
405 410 415 

Pro Asn Gly Pro Pro Pro Pro Trp Met Gin Pro Pro Pro Pro Pro Met 
420 425 430 

Asn Gin Gly Pro His Pro Pro Gly His His Gly Pro Pro Pro Met Asp 
435 440 445 

Gin Tyr Leu Gly Ser Thr Pro Val Gly Ser Gly Val Tyr Arg Leu His 
450 455 460 

Gin Gly Lys Gly Met Met Pro Pro Pro Pro Met Gly Met Met Pro Pro 
465 470 475 480 

Pro Pro Pro Pro Pro Ser Gly Gin Pro Pro Pro Pro Pro Ser Gly Pro 
485 490 495 

Leu Pro Pro Trp Gin Gin Gin Gin Gin Gin Pro Pro Pro Pro Pro Pro 
500 505 510 

Pro Ser Ser Ser Met Ala Ser Ser Thr Pro Leu Pro Trp Gin Gin Asn 
515 520 525 

Thr Thr Thr Thr Thr Thr Ser Ala Gly Thr Gly Ser lie Pro Pro Trp 
530 535 540 

Gin Gin Gin Gin Ala Ala Ala Ala Ala Ser Pro Gly Ala Pro Gin Met 
545 550 555 560 

Gin Gly Asn Pro Thr Met Val Pro Leu Pro Pro Gly Val Gin Pro Pro 
565 570 575 

Leu Pro Pro Gly Ala Pro Pro Pro Pro Pro Arg Ser lie Glu Cys Leu 
580 585 590 

Leu Cys Leu Leu Ser Leu Leu Thr Gin Leu Pro Leu Pro Leu Pro Lys 
595 600 605 . 

Pro Gly Arg Gin Asp Pro Ser Pro Arg Arg Arg Trp Pro Glu Pro 
610 615 620 



Claims 



1 . A DNA comprising the full structure or a part of the DNA represented by SEQ ID NO:1 or a DNA essentially equal 
to the DNA comprising the full structure or a part of the DNA represented by SEQ ID NO:1 . 

2. A polypeptide comprising the full structure or a part of the polypeptide coded for by the DNA represented by SEQ 
ID NO:1 or a polypeptide essentially equal to the polypeptide comprising the full structure or a part of a polypeptide 
coded for by the DNA represented by SEQ ID NO:1. 
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3. A transformant transformed by a plasmid having, integrated therein, the full structure or a part of the DNA repre- 
sented by SEQ ID NO: 1 which can be expressed therein. 

4. An antibody against the polypeptide as set forth in Claim 2 as an antigen. 

5. A gene analysis method which comprises using, as a primer, a probe or a marker, a DNA comprising a part of the 
DNA represented by SEQ ID NO:1 and hybridizing the primer, the probe or the marker with a DNA to be tested. 
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